Picture for Bing Yin

Bing Yin

CoMem: Context Management with A Decoupled Long-Context Model

Add code
May 29, 2026
Viaarxiv icon

Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation

Add code
May 12, 2026
Viaarxiv icon

Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts

Add code
Apr 21, 2026
Viaarxiv icon

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning

Add code
Apr 10, 2026
Viaarxiv icon

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

Add code
Apr 10, 2026
Viaarxiv icon

MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

Add code
Mar 26, 2026
Viaarxiv icon

Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards

Add code
Mar 25, 2026
Viaarxiv icon

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Add code
Mar 24, 2026
Viaarxiv icon

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon